Search CORE

5 research outputs found

Mining Frequent Itemsets Using Genetic Algorithm

Author: Biswas Sushanta
Ghosh Soumadip
Sarkar Debasree
Sarkar Partha Pratim
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 01/11/2010
Field of study

In general frequent itemsets are generated from large data sets by applying association rule mining algorithms like Apriori, Partition, Pincer-Search, Incremental, Border algorithm etc., which take too much computer time to compute all the frequent itemsets. By using Genetic Algorithm (GA) we can improve the scenario. The major advantage of using GA in the discovery of frequent itemsets is that they perform global search and its time complexity is less compared to other algorithms as the genetic algorithm is based on the greedy approach. The main aim of this paper is to find all the frequent itemsets from given data sets using genetic algorithm

arXiv.org e-Print Archive

CiteSeerX

Crossref

NOVEL GRAY SCALE CONVERSION TECHNIQUES BASED ON PIXEL DEPTH

Author: Amitava Nag
Anjan Pal
Arindrajit Pal
Debasish Biswas
Snehasish Banerjee
Soumadip Ghosh
Sushanta Biswas
Publication venue
Publication date: 06/03/2020
Field of study

CiteSeerX

A novel Neuro-fuzzy classification technique for data mining

Author: Biswas Sushanta
Ghosh Soumadip
Sarkar Debasree
Sarkar Partha Pratim
Publication venue: Production and hosting by Elsevier B.V.
Publication date: 01/11/2014
Field of study

AbstractIn our study, we proposed a novel Neuro-fuzzy classification technique for data mining. The inputs to the Neuro-fuzzy classification system were fuzzified by applying generalized bell-shaped membership function. The proposed method utilized a fuzzification matrix in which the input patterns were associated with a degree of membership to different classes. Based on the value of degree of membership a pattern would be attributed to a specific category or class. We applied our method to ten benchmark data sets from the UCI machine learning repository for classification. Our objective was to analyze the proposed method and, therefore compare its performance with two powerful supervised classification algorithms Radial Basis Function Neural Network (RBFNN) and Adaptive Neuro-fuzzy Inference System (ANFIS). We assessed the performance of these classification methods in terms of different performance measures such as accuracy, root-mean-square error, kappa statistic, true positive rate, false positive rate, precision, recall, and f-measure. In every aspect the proposed method proved to be superior to RBFNN and ANFIS algorithms

Elsevier - Publisher Connector

Directory of Open Access Journals

An Efficient Computational Risk Prediction Model of Heart Diseases Based on Dual-Stage Stacked Machine Learning Approaches

Author: Amitava Nag
Ranjan Maity
Soumadip Ghosh
Subhash Mondal
Yachang Omo
Publication venue: IEEE
Publication date: 01/01/2024
Field of study

Cardiovascular diseases (CVDs) continue to be a prominent cause of global mortality, necessitating the development of effective risk prediction models to combat the rise in heart disease (HD) mortality rates. This work presents a novel dual-stage stacked machine learning (ML) based computational risk prediction model for cardiac disorders. Leveraging a dataset that includes eleven significant characteristics from 1190 patients from five distinct sources, five ML classifiers are utilized to create the initial prediction model. To ensure robustness and generalizability, the classifiers are cross-validated ten times. The model performance is optimized by employing two hyperparameter tuning approaches: RandomizedSearchCV and GridSearchCV. These methods aim to find the optimal estimator values. The highest-performing models, specifically Random Forest, Extreme Gradient Boost, and Decision Tree undergo additional refinement using a stacking ensemble technique. The stacking model, which leverages the capabilities of the three models, attains a remarkable accuracy rate of 96%, a recall value of 0.98, and a ROC-AUC score of 0.96. Notably, the rate of false-negative results is below 1%, demonstrating a high level of accuracy and a non-overfitted model. To evaluate the model’s stability and repeatability, a comparable dataset consisting of 1000 occurrences is employed. The model consistently achieves an accuracy of 96.88% under identical experimental settings. This highlights the strength and dependability of the suggested computer model for predicting the risk of cardiac illnesses. The outcomes indicate that employing this two-step stacking ML method shows potential for prompt and precise diagnosis, hence aiding the worldwide endeavor to decrease fatalities caused by heart disease

Directory of Open Access Journals

Breast Cancer Detection by T-Shaped Slotted Planner Antenna

Author: Adnan S Abd-Alhameed RA, Hraga HI, Elfergani ITE,Noras JM and Halliwell R.
Alexopoulus NG.
Balinies C.
David M Pozar and Daniel H Schaubert.
Deschamps GA.
Mahalakshmi N and Vijay Jeyakumar.
Radouane Karli Hassan Ammor , Jamal El Aoufi.
Radoune Karli Hassan Ammor.
Saleh Alshehri.
Sangam Kumar Singh and Arun Kumar Singh.
Sarva E Sridevi PV.
Seladji-Hassaine N Merad L, Meriah SM and Bendimerad FT.
Soumadip Ghosh Sushanta Biswas, Debasree Chanda Sarkar, Partha Pratim Sarkar.
Sudhir Shrestha Mangilal Agarwal, Joshua Reid and Kody Varahramyan.
Sudhir Shrestha Mangilal Agarwal, Parvin Ghane and Kody Varahramyan.
Yu YJ Yuan M and Liu QH.
Publication venue: 'Indian Society for Education and Environment'
Publication date
Field of study

Crossref